Major Open Source Release! Native Multimodal LongCat-Next Released, Making Vision and Speech the Mother Tongue of AI
Global AI is undergoing an 'AI-native' technological shift. Addressing the current 'language-centric, externally patched vision or speech' architecture, a team released and open-sourced the native multimodal large model LongCat-Next and a discrete tokenizer, aiming to break modal barriers and enable AI to understand the physical world like processing text, achieved by reconstructing the underlying architecture.....